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Abstract 

Consider the Cayley graph of the cyclic group of prime order q 
with k uniformly chosen generators. For fixed k, we prove that the 
diameter of said graph is asymptotically (in q) of order fyq. 

The same also holds when the generating set is taken to be a 
symmetric set of size 2k. 

1 Introduction 

Let G be a finite group. Let S be a subset of G. The (directed) Cayley graph 
of G (w.r.t. S) is a graph (V, E) with V = G and (g, h) 6 E if and only if 
hr x g G S. The elements of S are then called generators and S the generating 
set. If S is symmetric (w.r.t. inversion) then the resulting Cayley graph is 
essentially undirected - if (g, h) is in E then so is (h,g). 

A "random random walk" on a group G is a random walk on the Cayley 
graph of G, with a generating set chosen randomly in some fashion. The 
random walk itself may be simple, each edge having equal probability at 
each step, or not simple, with some nonuniform distribution on the edges. 
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Usually, the generating set is chosen uniformly from all sets of some prefixed 
size k. 

Various aspects, most notably the typical mixing time, of random ran- 
dom walks on different finite groups have been studied. (See [H [5] for some 
examples, and [2] which gives a comprehensive survey). The results usually 
refer to Abelian groups in varying degrees of generality, from cyclic groups 
up to general finite Abelian groups. 

Roichman ([6]) notes that the diameter of the random Cayley graph is 
bounded by a constant times the mixing time, and applies this bound to the 
case of general groups of order n and k = \}og a n\ . The resulting bound is 

log fc n, which is proved to be tight for the case of Abelian groups. 

For the cyclic group Z n , Hildebrand [3] proved that the mixing time is 
of order n 2 ^ k ~ 1 \ and therefore, this is also a bound on the diameter of this 
random Cayley graph. However, in contrast with the results in [BJ, in this 
case the diameter is actually much smaller - we prove it to be 0(n l l k ). 

Note that as far as mixing times are concerned, there is no difference 
between a particular set of generators, S and the set S + c, attained by 
adding a constant c £ Z ? to all the generators in S. The diameter, however, 
might change significantly. To see this, consider, for example, a generating 
set of two elements, S = {1, [y^]} in Z q (where q is prime). The diameter 
of this Cayley graph is 2^/q. Now, observe S' = S — 1 = {0, La/^J}- The 
diameter of this Cayley graph is now q. Put another way, the diameter does 
not change when adding or removing from the generating set but the mixing 
time might change considerably. 

Another point of notice is the question of symmetric vs. asymmetric 
generating sets. The results in [3] are for asymmetric generating sets, i.e. 
S contains just the k randomly chosen generators. We might as well ask 
about the mixing times and diameters w.r.t. S = S[J(—S). The resulting 
random walk is now symmetric, which is sometimes more natural to consider. 
It seems that the results in [3], when applied to the symmetric case, would 
yield a mixing time of order n 2//fe . In contrast, the results in this paper apply 
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equally to the symmetric case. 

It should be noted that the asymptotic behavior of both the mixing time 
and the diameter, both in the symmetric and asymmetric case are the same 
as in the case of a A; dimensional tori of volume q. This is not coincidental, 
the structure of the Cayley graph of G w.r.t. S is actually that of Z fc , modulo 
some /c-dimensional lattice, which contains the lattice of all multiples of q. 
Perhaps the mixing time results of [3] could be proved in a more elementary 
manner using that perspective. 



2 Main results and open questions 

Let Z g be the cyclic group of order q, a prime number. Let gi,--,gk be k 
random generators chosen uniformly and independently from Z ? . Denote 
by Diam(q, k) the random variable which is the diameter of the resulting 
(directed) Cayley graph. The same proofs work, mutatis mutandis, for the 
diameter of the undirected Cayley graph, that is, if our generating set is taken 
to be {5-1, -g u ...,g k , -g k }. 

A simple counting argument shows that the diameter is at least Q(tfq). 
We prove that the diameter is Q(tfq) in the following sense: 

Theorem 1 For all k > 0, 

lim \im sup F(Diam(q, k) > C tyq) = 

Also, this result is tight in the sense that: 
Theorem 2 For all k > and all C , 

liminf F(Diam(q, k) > C Hq) > 

In other words, the limit behavior of the distribution of Dm ™(?< k *) j s n on- 
degenerate. This seems to hint at the following conjecture: 

Conjecture 3 converges (in distribution) to some distribution D(k) 

on R which has a non- compact support. 
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If this conjecture is true, an obvious question would be to find out what 
this limit distribution is. 

3 Proof of Theorem H 

For i 6 Z, and i = ..,ik} G {0, . . . , L} k a vector of indices, let M 
be the event %xQ\ + %292 + •• + %k9k — x (mod q). Let A X L = {Jj eL k anc ^ 
let A L = C\x€Z q ^L- We abuse the notation and identify an event with its 
indicator function. 

If Al occurs then the diameter of the Cayley graph is at most kL, while if 
Ai doesn't occur then the diameter is at least L, which is the same order of 
magnitude, since k is fixed. Therefore, to prove both theorems it is enough 
to bound P(v4c m) from above and below. 

Obviously, for any i ^ k and any x G Z 9 we have E(A?) = 1/q. 

Next we want to calculate K(A^Aj). This is the same as asking how many 
solutions, in (Z (? ) fc , are there for: 

H9i + i292 + ■■+ ik9k = x 

ji9i +3292 + ■■ +jk9k = x 

If i and j are linearly independent over Z g then the number of solutions is 
exactly q k ~ 2 . In that case K(A^Aj) = 1/q 2 and therefore the events are 
independent. If i and j are linearly dependent over Z 9 then i = Xj for some 
A / 1. In that case there are no solutions since x ^ Xx (except for x = 
which we can ignore). Therefore, in that case 

Cov(Af , Af) = E(AjAj) - E(Aj)E(Aj) = - 1/q 2 < . 

Let B X L = J2ieL* A r Notice that A% = if and only if B X L = 0. By 
linearity of expectation, 

T k 

E(Bl) = mi) = — ■ 
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Var(Af) = i(l - i) < \ and Cov(Af, AJ) < 0, giving 

Var(IT) = £ Var(Af ) + ^ ^ Cov(A? , Af) < j . 

ieL k i£L k i^jeL k 

Chebyshev's inequality now yields 

F(Bl = 0) < H\B'i - HBl)\ > E(BD) < < = j_ 

and therefore 

P(A£) = 1-P(££ = 0)>1--^ 

Let Tl = {^lAfJ be the set of all points in T, q that can be reached by 
using each generator at most L times, and let Bl = \Tl\ = Ylxez A\. Fix 
C > and let L = Ctfq. We now have P(A£) > 1 - £ = 1 - ^. Therefore 
E(-Bl) > <?(1 — ^fc)- Since B < q, we can use Markov's inequality on q — B L 
to get 

> f ) = 1 - P(g - B L > |) > 1 - f = 1 - A . 

Now if i?^ > I then for every 1 6 Z, we have T H (x — T) 7^ 0. This 
means that A^l occurs and therefore the diameter is at most 2kL. 
Therefore, 

F(Diam(q, k) > Ctfq) < P(fl (C /2k) > \) < — ^ ^ 
as required. ■ 



4 Proof of Theorem [2] 

Fix some D < 1 and let L = D ^fq. Consider the events A2 and A° L as 
previously defined. As before, if i and j are linearly independent over Z q 
then A? and A® are independent events. If i and j are linearly dependent 
then A2 and AS are in fact the same event. How many distinct events do we 
have among {A2} ieL k7 

Lemma 4 There are at least L k /2 = D k q/2 such distinct events. 
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Proof. Let i and j be linearly dependent, i.e. there exist c e Z q such that 
i = cj. In particular i = cjo (mod g) and i x = cj'i (mod g). eliminating c, 
we get = iijo (mod q). Since io,ii,jo and ji are all less than y/g (since 
k > 2) we get i ji = iijo- 

Take all i £ L k for which i and i\ are coprime in Z. If z and Zi are 
coprime and jo an d ji are coprime and ZqJi = joi\ then z = jo and ix = 
ji. Therefore, among the vectors considered above every two are linearly 
independent, so the corresponding events are distinct. 

Given L how many pairs iq, %\ < L are coprime? It is a well known fact 
(see that the fraction of coprime pairs tends to, and is always greater 
than, 6/vr 2 > 1/2. ■ 

Let / C L k be a set of index vectors such that every two are linearly 
independent and |/| = \D k q/2]L. Let X = J2i E i A j and notice that F(A° L ) > 
P(X > 0). 

Lemma 5 P(X > 0) > D k /3 

Proof. For all i £ I we have E(v4°) = i so 

D k a 1 D k 
EI) = = — 

V ; 2 g 2 

and these events are pairwise independent, so 

D k n 1 D k n D k n 1 D k 1 D fc 
V; 2 g 2 V 2 J q 2 2 V ? 2 ; 

Since X is nonnegative, from Cauchy-Schwartz inequality we get 
E(X) 2 = E(X1 X>0 ) 2 < E(X 2 )E((1 X>0 ) 2 ) = E(X 2 )P(X > 0) 
and therefore 

fix >o)> E(Jf)2 - 5: > B " 

as required. ■ 
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From lemma we get that the probability of A° L is bounded away from 

regardless of q. Next we shall show that if A° L occurs then many different 

1 yield the same member of Z q , in which case the diameter cannot be too 
small. 

Lemma 6 Let C be such that kDC k ~ l < 1. If A° D] ^ occurs then 

Diam(q, k) > C tyq . 

Proof. Let L = Djfq and let i £ {0, . . . , L} k be such that A2 occurs. If j 
and j' differ by a multiple of i then 

3i9i + 3292 + ■■+ jk9k = j[gi + f 2 92 + ■■+ f k 9k (mod q) . 
Therefore for every j £ {0, . . . , L} h there exists j' such that 

ji9i + 3292 + ■■+ 3k9k = j'i9i + 3%92 + ■■+ 3k9k (mod q) 

and j' n < D ^fq for some 1 < n < k. The number of such j' is bounded by 
kD tfq(C tfq) k ~ 1 = kDC k ~ 1 q < q. Therefore, if A° D! ^ occurs not all vertices 
are covered by combinations in {0, . . . , C ^/q k and hence the diameter is at 
least C '^fq. ■ 

To wrap up the proof, given C, let D = \/{2kC k ~ l ). From lemma [5] we 
conclude that A° Dj ^ occurs with probability at least D/3, in which case, by 
lemma [6] we get Diam(q,k) > \JCq. ■ 
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